The Grammatical Interpretation of Russian Inflected Forms Using a Stem Dictionary
نویسندگان
چکیده
THE NPL Russian-English automatic dictionary is organised on a stem-paradigm basis wherein there is for most nouns and adjectives a single entry for all their inflected forms and for most verbs only one or two entries. This is in contrast to the full-form type of dictionary organisation wherein each inflected form of every word has a separate entry. The decision to organise our dictionary on this basis was made so as to be able to accommodate it on the magnetic tape store available to us on the ACE digital electronic computer of our laboratory, and, further, to minimise the look-up time per word on the computer without complicating the look-up procedure too much or investing too much programming effort in its compilation. The word content of the dictionary initially is to be 15,000 words from the Harvard University Automatic Dictionary. Out dictionary will have an average of about 1.5 entries per word, whereas a fullform dictionary would have about ten times that average.
منابع مشابه
LexInfo as a Model for Creating Ontology-Based Dictionary of Russian Grammatical Forms
This article describes possibilities of using OntoLex as a model for creating an ontology of morpho-syntactic properties of the Russian language. For this purpose we analysed morpho-syntactic properties of Russian, given in LexInfo and then extended it with grammatical categories that are not represented or that are not correctly defined in LexInfo. The introduced supplements and adjustments en...
متن کاملDictionary of Abstract and Concrete Words of the Russian Language: A Methodology for Creation and Application
The paper describes the first stage of a project on creating an electronic dictionary with numerical estimates of the degree of abstractness and concreteness of Russian words. Our approach is to integrate data obtained from several different sources: text corpora, psycholinguistic experiments, published dictionaries, markers of abstractness (certain suffixes) and a translation of a similar dict...
متن کاملRomanian Lexical Data Bases: Inflected and Syllabic Forms Dictionaries
This paper presents two lexical data bases for Romanian: RoMorphoDict, a dictionary of inflected forms and RoSyllabiDict, a dictionary of syllabified inflected forms. Each data basis is available in two Unicode formats: text and XML. An entry of RoMorphoDict, in text format, contains information on inflected form, its lemma, its morpho-syntactic description and the marking of the stressed vowel...
متن کاملITRI-03-02 A large-scale inheritance-based morphological lexicon for Russian
In this paper we describe the mapping of Zaliznjak’s (1977) morphological classes into the lexical representation language DATR (Evans and Gazdar 1996). On the basis of the resulting DATR theory a set of fully inflected forms together with their associated morphosyntax can automatically be generated from the electronic version of Zaliznjak’s dictionary (Ilola and Mustajoki 1989). From this data...
متن کاملA Large-scale Inheritance-based Morphological Lexicon for Russian
In this paper we describe the mapping of Zaliznjak’s (1977) morphological classes into the lexical representation language DATR (Evans and Gazdar 1996). On the basis of the resulting DATR theory a set of fully inflected forms together with their associated morphosyntax can automatically be generated from the electronic version of Zaliznjak’s dictionary (Ilola and Mustajoki 1989). From this data...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009